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Prdm4 is a highly conserved member of the Prdm family of PR/SET domain zinc finger proteins. Many well-studied Prdm family 
members play critical roles in development and display striking loss-of-function phenotypes. Prdm4 functional contributions 
have yet to be characterized. Here, we describe its widespread expression in the early embryo and adult tissues. We demonstrate 
that DNA binding is exclusively mediated by the Prdm4 zinc finger domain, and we characterize its tripartite consensus se- 
quence via SELEX (systematic evolution of ligands by exponential enrichment) and ChlP-seq (chromatin immunoprecipitation- 
sequencing) experiments. In embryonic stem cells (ESCs), Prdm4 regulates key pluripotency and differentiation pathways. Two 
independent strategies, namely, targeted deletion of the zinc finger domain and generation of a EUCOMM LacZ reporter allele, 
resulted in functional null alleles. However, homozygous mutant embryos develop normally and adults are healthy and fertile. 
Collectively, these results strongly suggest that Prdm4 functions redundantly with other transcriptional partners to coopera- 
tively regulate gene expression in the embryo and adult animal. 



PRDM family members share a characteristic modular structure 
with the N-terminal PR/SET domain and a variable number of 
C-terminal zinc finger (ZF) repeats (1-3). Gene targeting experi- 
ments have revealed critical roles as ceU-type-specific transcriptional 
regulators of mouse development. For example, targeted loss of 
Prdml disrupts germ cell specification, foreKmb patterning, placental 
morphogenesis, and postnatal reprogramming of intestinal entero- 
cytes (4-9). Similarly, Prdml4 plays an essential role in specification 
of the germ cell lineage (10). Prdm 16 promotes brown fat identity 
and represses the alternative skeletal muscle cell fate (11). Interest- 
ingly, a Prdm i 6 splice site mutation that creates a hypomorphic allele 
causes craniofacial skeletal defects (12). Prdm3 and Prdm 16 regulate 
maintenance of hematopoietic stem cell function (13, 14). Prdm9 
governs meiotic recombination (15-17). Deregulated expression of 
several Prdm family members, including PRDMl, PRDM2/RIZ1, 
PRDM3, PRDM5, PRDM14, and PRDM16, has been implicated in 
human cancers (1,3). 

The evolutionarily conserved PR/SET domain is 20 to 30% 
identical to the SET domain found in numerous histone lysine 
methyltransferases (HMTs) that modify nucleosome structure 
(18, 19). Several Prdms, including Prdm3, Prdm9, and Prdml6, 
act directly as histone H3 methyltransferases (1,3, 20), while oth- 
ers seem to lack intrinsic catalytic activity. Rather, their ability to 
regulate target gene expression is probably mediated via recruit- 
ment of epigenetic partners such as G9a, histone deacetylases 
(HDACs), Lsdl, and the arginine methyltransferase PrmtS (21). 

The zinc finger (ZF) repeats typically function to mediate nu- 
clear import and sequence-specific DNA binding (1, 3, 21). 
Unique consensus binding motifs recognized by several family 
members have been previously characterized. For example, 
Prdm9 recognizes a 13-mer motif enriched at recombination hot 
spots (22). Consistent with its proposed role in osteogenic differ- 
entiation (23), PrdmS predominantly binds within the exonic re- 
gions of collagen genes in association with RNA polymerase II. 



Prdml4 binding sites in embryonic stem cells (ESCs) substantially 
overlap those occupied by Nanog and Oct4, in keeping with its 
ability to protect ESCs from entering extraembryonic endoderm 
fates and ensure pluripotency (24). Blimpl/Prdml directly si- 
lences the promoter regions of key transcription factors such as 
c-myc, Pax5, and CIITA expressed in B lymphocytes to dramati- 
cally shift the developmental program toward a terminal plasma 
cell fate (25). Similarly, Prdml directly represses c-myc expression 
in macrophages and sebaceous gland progenitors to arrest cell 
cycle progression, but c-myc is not a target in T cells (26-28). 
Rather, Prdml silences expression of an essential cytokine, inter- 
leukin-2 (IL-2), necessary for T cell proliferation (29). A distinct 
set of Blimpl/Prdml transcriptional targets has been character- 
ized in the skin epidermis (30). Genome-wide ChlP-chip (chro- 
matin immunoprecipitation with microarray technology) exper- 
iments identified an extended set of Blimpl/Prdml-occupied 
promoters in human myelomas (31). Collectively, these experi- 
ments suggest that a dynamic process governs ceU-type-specific 
Blimpl/Prdml target site selection within the nucleus. 
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Considerably less is known about transcriptional targets and 
the functional roles played by other Prdm family members. In 
particular, Prdm4 was originally identified in a yeast two-hybrid 
screen as a factor that interacts with the p75 neurotrophin recep- 
tor and displays a dynamic pattern of expression in the developing 
nervous system (32-34). A recent report suggests that Prdm4 con- 
trols proliferation and differentiation in neural stem cells (33). 
Prdm4 has a long N-terminal domain including a zinc knuckle, 
followed by the PR/SET domain and six C2H2 ZFs (32, 35). As for 
other PRDMs, Prdm4 interacts with HDACs (36), as well as PrmtS 
(33), and its zinc fingers mediate nuclear import (36). However, 
developmentally regulated expression of Prdm4 outside the cen- 
tral nervous system and characterization of a loss-of-function al- 
lele have yet to be reported. 

Here, we describe widespread Prdm4 expression during early 
mouse development and throughout adult tissues. The strongest 
expression was detectable in reproductive tissues. To characterize 
the Prdm4 DNA binding consensus motif, we performed system- 
atic evolution of ligands by exponential enrichment (SELEX) ex- 
periments. To assess Prdm4 occupancy in mouse embryonic stem 
cells (ESCs), we performed a genome-wide screen via chromatin 
immunoprecipitation followed by high-throughput sequencing 
(ChlP-seq). Interestingly, Prdm4 displays a marked bias toward 
binding proximaUy to transcription start sites (TSSs). As expected, 
targeted deletion of the zinc finger domain (ZFD) encoded by 
exons 9 to 11 disrupts nuclear import and DNA binding. Expres- 
sion microarray experiments reveal significant changes in Prdm4- 
dependent transcriptional profiles in ESCs consistent with a func- 
tional role in governing pluripotency and differentiation. 
Unexpectedly, however, homozygous mutant embryos develop 
normally and adults are healthy and fertile. We conclude that 
Prdm4 influences the core regulatory circuitry in cultured ESCs 
but plays a nonessential role in vivo. 

MATERIALS AND METHODS 

ISH and histology. Embryonic day 6.5 (E6.5) to E9.5 embryos were fixed 
with 4% paraformaldehyde (PFA) overnight at 4°C, and whole-mount in 
situ hybridization (ISH) analysis was carried out according to standard 
protocols (37). Prdm4 riboprobes spanning nucleotides (nt) 3081 to 3506 
of the sequence with accession no. NM_181650 (kindly provided by 
Adrian Moore, RIKEN Brain Science Institute, Wako, Saitama, lapan) 
and the full-length cDNA (IMAGE clone 8862431) were used. For histol- 
ogy, embryos were postfixed in 4% PFA, dehydrated through an ethanol 
series, embedded in paraffin, sectioned at 8 |j.m, and eosin counterstained. 
For section ISH, tissue was fixed overnight in 4% paraformaldehyde, de- 
hydrated through an ethanol series, embedded in paraffin, sectioned at 7 
to 8 jjim, and processed according to standard protocols. Testes and ova- 
ries were fixed overnight in 4% PFA, dehydrated in ethanol, embedded in 
paraffin, sectioned at 8 |jLm, and stained with hematoxylin and eosin. 

Northern blot analysis. Total RNA was extracted from individual 
organs of 4-week-old C57BL/6J mice using the TRIzol method (Gibco/ 
BRL). Total RNA (10 jjug/lane) was size fractionated on a 1% agarose- 
formaldehyde gel, transferred onto Hybond N membranes (GE Health- 
care), and probed with a '^P-random-primed full-length cDNA Prdm4 
fragment. 

Gene targeting. The Prdm4^^^ targeting vector was generated by li- 
gating a 6.9-kb 5' homology region (EcoRI-XhoI) and a 4.6-kb 3' homol- 
ogy region (Afel-EcoRV) from the bMQ362d07 bacterial artificial chro- 
mosome (BAG) (Source Bioscience, Gambridge, United Kingdom) and 
the /oxP-flanked pglc-iiygromycin cassette (2.1 kb) (38) into a modified 
version of pBSII-KS( — ) (Stratagene). The hsv-thymidine Icinase (hsv-tk) 
cassette was added outside the 3' homology region. A Notl-linearized 



targeting vector (15 |jLg) was introduced into GGE ESGs by electropora- 
tion. Drug-resistant colonies selected in the presence of hygromycin (1.5 
|jLg/ml) and l-[2'-deoxy-2'-fluoro-P-D-arabinofuranosyl]-5-iodouracil 
(FIAU) (0. 1 [jig/ml) were screened by Southern blotting using the restric- 
tion enzyme and probe combinations shown in Fig. 5. For excision of the 
loxP-flanked pgk-hygromycin cassette, correctly targeted Prdm4^''^^^ 
clones were transiently transfected with pMClCre and subsequently 
screened by Southern blotting. Three excised clones were retargeted to 
generate doubly targeted Prdm4'^^^'^^ clones. For biochemical studies, 
wild-type and Prdm4'^^^^^^ clones were adapted to grow under feeder- 
free conditions on gelatin-coated plates in medium containing 1,000 
U/ml of leukemia inhibitory factor (LIE). To generate animals carrying 
the targeted allele, G57BL/6I blastocysts were injected with 12 to 14 
Prdm4*'^^ ESGs and transferred into E2.5 pseudopregnant foster fe- 
males. The following primers and cycling conditions were used for PGR 
genotyping to distinguish the wild-type and AZF alleles: common primer, 
TGG TTA GAG AGG GTA TGG TAT GA; wild-type primer, GGG GAG 
GAA ATT GTG TTG TTG A; mutant primer, GAT GGT GAG GTA GAG 
GGA AGA; 60°G annealing temperature, 40 cycles. The Prdm4^^ 
(Prdm4EUG0MM) targeting vector (project ID 45696) was obtained 
from Helmholtz Zentrum Miinchen Deutsches Forschungszentrum fur 
Gesundheit und Umwelt (GmbH), Germany. AsiSI-linearized targeting 
vector (15 |JLg) was introduced by electroporation into GGE ESGs, and 
clones were selected in the presence of G418 (200 ixg/ml). Drug-resistant 
colonies were screened by Southern blotting using the restriction enzyme 
and probe combinations shown in Fig. 9A to identify correctly targeted 
clones (see Fig. 9B). Gorrectly targeted Prdm4^'^^ ESGs were used to 
generate germ line chimeras, and the resulting heterozygous animals were 
intercrossed to obtain homozygous mice for tissue analysis. The Prdm4^^ 
allele was excised and converted into the null Prdm4^'"^ allele by crossing 
heterozygous males with Sox2.Cre female carriers (39). The following 
primers and cycling conditions were used for PGR genotyping: for the 
Prdm4'^'^ allele, common primer, GGG AGA GGG ATG AGT AGG TT; 
wild-type primer, GGA GGT TGT AGG TGG GGT AA; mutant primer, 
AAA GGA ATA GGA TGA GAA ATT TGA; 58°G annealing temperature, 
40 cycles; for the Prdm4'-''^^ allele, common primer, AAG TGG ATG AGT 
TTA TGG GGT A; wild-type primer, AGA TTT GTG GGG GGA GTT TT; 
mutant primer, AAA GGA ATA GGA TGA GAA ATT TGA. All animal 
experiments were performed in accordance with Home Office regula- 
tions. 

Generation of stably transfected ESCs expressing GFP-epitope- tagged 
full-length and AZF Prdm4. The full-length Prdm4 coding sequence was 
PGR amplified fi-om FANT0M3 clone 402240 1E08 using primers GATAGA 
ATTGAGGATGAATGAGATGAAGTTGAGG and TATGGGGGGGTTT 
ATGTGGGGAGAGAGAGTG to introduce EcoRI and Smal restriction 
sites. Alternatively, to generate the pGAGGS-Prdm4AZF-EGFP (en- 
hanced green fluorescent protein) expression vector, the Prdm4^^ cod- 
ing sequences were cloned from doubly targeted ESGs via reverse tran- 
scription-PGR (RT-PGR). The PGR products were cloned into 
complementary sites of pEGFP-N2 (Glontech), subsequently excised us- 
ing Xhol and NotI, and inserted into a modified version of pGAGGS (40) 
containing an internal ribosome entry site (IRES) puromycin resistance 
cassette. The Sall-linearized vectors were electroporated into gelatin- 
adapted GGE or Prdm4'^^^^^^ ESGs, and puromycin-resistant clones 
were screened for EGFP expression by flow cytometry and Western blot 
analysis. 

SELEX. To generate the recombinant Prdm4 zinc flnger domain, the 
coding sequence was PGR amplifled from IMAGE clone (ID 8862431) 
using the primers Forward (GAT AGG ATG GGA TGG GGG AAG GGA 
GAG GAA GGA AAG G) and Reverse (GAT AGT GGA GTT ATT AGG 
AGG TGG GGT GTT TGG AGG TGT TGA G) and cloned into Xhol and 
BamHI sites of modified pET28a (Novagen). The bacterially expressed 
recombinant protein was induced with 0. 1 mM isopropyl-ji-D-thioga- 
lactopyranoside (IPTG) and purified, and systematic evolution of li- 
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gands by exponential enrichment (SELEX) was performed as previ- 
ously described (41). 

Cell fractionation and Western blotting. Total cell lysates were pre- 
pared and Western blot analysis was performed as described previously 
(41) using a mouse anti-GFP monoclonal antibody, JL-8 (Clontech; cat- 
alog no. 632381; 1:1,000). Alternatively, isolation of nuclear and cytoplas- 
mic fractions for Western blot analysis was performed as described previ- 
ously (42). 

EMSA. Electrophoretic mobility shift assays (EMSAs) were performed 
as described previously (41). Nuclear complexes were resolved in 0.8% 
agarose gels, or alternatively, 5% acrylamide gels were used for analysis of 
His-tagged purified Prdm4. Oligonucleotides used for EMSA probes were 
as follows: for wild-type Bahccl, GGC CTG GGT CGG CCC GCG GGG 
ATC CTG GAA ACC GTC CCC GGT TTA TCT CCT T and GGA AGG 
AGA TAA ACC GGG GAC GGT TTC GAG GAT CCC GGC GGG CCC 
ACC GAG G; for AGAAAC Bahccl, GGC CTG GGT CGG CCC GCG 
GGG ATC CTG TCC CAC GTC CCC GGT TTA TCT CCT T and GGA 
AGG AGA TAA ACC GGG GAC GTC GGA GAG GAT CCC GGC GGG 
CCC ACC CAG G; for ACT Bahccl, GGC CTG GGT CGG CCC GCG 
GGG ATC AGG GAA ACC GTC CCC GGT TTA TCT CCT T and GGA 
AGG AGA TAA ACC GGG GAC GGT TTC CCT GAT CCC CCC GGG 
CCC ACC CAG G; for AGGGG Bahccl, GGC CTG GGT CGG CCC GGT 
TTT ATC CTG GAA ACC GTC CCC GGT TTA TCT CCT T and GGA 
AGG AGA TAA ACC GGG GAC GGT TTC CAG GAT AAA AGC GGG 
CCC ACC CAG G; for AGGGG/ ACT Bahccl, GGC CTG GGT CGG CCC 
GCT TTT ATC AGG GAA ACC GTC CCC GGT TTA TCT CCT T and 
GGA AGG AGA TAA ACC GGG GAC GGT TTC CCT GAT AAA AGC 
GGG CCG ACC CAG G. Antibody supershifting was performed by the 
addition of 0.1 fjig of anti-Prdm4 rabbit polyclonal antibody (Sigma- Al- 
drich; HPA024322) or 1 |jLg anti-GFP mouse monoclonal antibody 3E6 
(Invitrogen; Al 1 120) to the binding reaction mixtures. 

Chromatin immunoprecipitation and deep sequencing (ChlP-seq). 
Stably transfected ESC clones expressing fuU-length Prdm4-EGFP (FL- 
Prdm4-EGFP) (2 X 10^ to 3 X lO' cells for each sample) were subjected to 
Chip using either 6 ixg of anti-GFP monoclonal antibody (Invitrogen; 
All 120, clone 3E6, IgG2a) or control mouse IgG (Santa Cruz; sc-2025, 
mouse IgG) as described previously (43). The resulting DNA samples 
were multiplexed and sequenced using two lanes on an lUumina HiSeq 
2000 sequencer. 

Whole-genome ChlP-seq analysis. Sequence reads were mapped to 
the mm9 mouse genome release with Stampy using default parameters 
(44). Peak calling was performed using MACS2 (45, 46), using default 
parameters to call areas of enrichment in the anti-GFP ChIP over the 
nonspecific whole-mouse IgG control ChlP. Overlapping peaks with a 
score of > 10 in all four replicates were identified, and the core region of 
overlap was used for further analyses. The genomic distribution of ChlP- 
seq peaks compared to gene annotations was determined using CEAS (47). 
Genes of Ensembl release 67 with proximal Prdm4 binding were identified 
using custom Perl scripts. De novo motif finding within ChlP-seq peaks was 
performed using MEME (48). ChlP-seq peak coordinates were compared to 
Ensembl Regulatory Features of release 67 and ERV1:RLTR23 regions using 
custom Perl scripts. Functional annotation of Prdm4 ChlP-seq peaks was 
performed using GREAT version 2.0.2 using the basal plus extension rule, 
annotating genes within 5 kb of transcription start sites initially and within 1 
Mb where no proximal gene exists (49). Terms with a binomial P value 
of SI X 10^^ were considered significant. 

For comparison, Smad2 ChlP-seq peak coordinates were downloaded 
from NCBI GEO accession numbers GSM578474 and GSM578475, and 
Klf5 ChlP-seq peak coordinates were acquired from the online version of 
the work of Parisi et al. (50). Genes with proximal Smad2 and KlfS binding 
were identified as outlined above. The association between proximal 
binding and differentially expressed genes was calculated by chi-square 
test. 

Microarray experiments. ESCs were washed and directly lysed in 
TRIzol (Invitrogen) on culture plates. RNA was extracted and Turbo 



DNase (Invitrogen) treated according to the manufacturers' instructions. 
The RNA was then cleaned using an RNeasy minikit ( Qiagen) and hybrid- 
ized to lUumina Mouse WG-6 v2 Expression BeadChips as described pre- 
viously (4). Four biological replicates were performed for Prdm4'^^^^^^ 
cells, and six biological replicates each were performed for wild-type and 
stably transfected ESCs expressing fuU-length Prdm4-EGFP. 

Microarray data analysis. Differential expression was determined fol- 
lowing rank-invariant normalization by using the lUumina custom error 
model option with Benjamini and Hochberg false discovery rate. Probes 
with an lUumina DiffScore of >30, equivalent to a P value of < 1 X 10^^, 
were considered significant. Probes corresponding to genes in Ensembl 
release 67 were then compared to Prdm4 binding regions identified by 
ChlP-seq. Functional annotation analysis was performed using DAVID 
Informatics Resources 6.7 (51, 52). Enriched gene ontology biological 
process terms with Benjamini-Hochberg corrected? values of S2 X 10^^ 
were considered significant. 

qPCR. Quantitative PCR (qPCR) was performed as previously de- 
scribed (42) with the exception that expression was normalized to Actb 
rather than Hprt. Primer sequences used are shown in Data set SI in the 
supplemental material. 

Alkaline phosphatase assay. Cells were plated at 200 cells per well in 
6-well plates in the presence of LIE. Cells were then subsequently cultured 
in the presence or absence of LIE for 4 to 6 days and then stained with the 
alkaline phosphatase detection kit (MUlipore) per the manufacturer's in- 
structions. 

Immunofluorescence microscopy of embryonic stem cells. ESCs 
were plated on gelatin-coated coverslips and cultured for up to 3 days. 
Cells were fixed with 4% paraformaldehyde and permeabilized with 0.2% 
Triton X- 100 -phosphate-buffered saline (PBS) before blocking with 10% 
donkey serum with 1% bovine serum albumin (BSA) and 0.05% Tween 
20 in PBS. Primary antibodies used include goat anti-0ct4 (Santa Cruz; 
sc-8628) and rabbit anti-Nanog (Abeam; ab80892), foUowed by the ap- 
propriate conjugated secondary antibodies: anti-goat Alexa Fluor 488- 
conjugated antibodies and anti-rabbit Alexa Fluor 555-conjugated anti- 
bodies (Molecular Probes/Invitrogen). Coverslips were mounted with 
Vectashield mounting agent containing 4',6-diamidino-2-phenylindole 
(DAPI; Vector Laboratories; H-1200). Fluorescent images were captured 
with a Leica epifluorescence microscope. 

Microarray data accession numbers. The microarray and ChlP-seq 
data have been deposited in NCBI GEO with the accession numbers 
GSE46308 and GSE48372, respectively. 

RESULTS 

Prdm4 is widely expressed throughout the developing embryo 
and adult tissues. Unlike Prdml and Prdml4 transcripts selec- 
tively expressed in germ cells at early postimplantation stages, sin- 
gle-cell profiling experiments revealed that Prdm4 transcripts 
were present at roughly equivalent levels in both the somatic and 
germ cell lineages (53). Similarly here, whole-mount in situ hy- 
bridization experiments demonstrate ubiquitous Prdm4 expres- 
sion throughout the embryo at E6. 5 (Fig. lA). A striking exception 
that lacks expression is the ectoplacental cone (EPC), a derivative 
of the extraembryonic ectoderm. At E7.5 (Fig. IB) and slightly 
later at E9.0 (Fig. IC), we similarly observe uniformly strong ex- 
pression throughout the embryo proper. In the developing pla- 
centa, transcripts are readily detectable in the labyrinth layer, de- 
rived from the chorion and allantois, but absent in the 
spongiotrophoblast layer derived from the EPC (Fig. ID). 

Northern blot analysis demonstrates that Prdm4 mRNA is 
strongly expressed in ESCs and E9.5 embryos (Fig. IE), and sim- 
ilar to findings in the developing embryo, we also detect expres- 
sion in all adult tissues tested except the liver (Fig. IE). Reproduc- 
tive tissues, including the testes, ovaries, and uterus, show 
particularly high levels of expression. Next, we performed whole- 
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FIG 1 Prdm4 RNA expression analysis during embryonic and postnatal development. (A) Prdm4 whole-mount in situ hybridization at E6.5 shows widespread 
expression in all tissues, with the exception of the EPC. (B) At E7.5, Prdm4 expression is apparent throughout the embryonic and extraembryonic tissues. (C) At 
E9.0, Prdm4 is broadly expressed. (D) ISH of a midsagittal section through the forming placenta shows Prdm4 expression confined to the developing labyrinth. 
(E) Northern blot assay on ESCs, 4-week-old mouse tissues, and E9.5 embryo using a radiolabeled probe spanning the entire Prdm4 coding sequence. 28S and 
18S bands are shown as loading controls. (Fand G) Whole-mount ISH analysis ofE13.5 female and male gonads shows that Pr(jm4 is most prominently expressed 
in the MiiUerian duct and forming testis cords, respectively. (H) In the adult ovary, high levels of expression are present in all stages of oocyte maturation and in 
the corpus luteum. (I) Prdm4 transcripts are abundantly expressed in developing spermatozoa and supporting cells of the seminiferous tubules. Abbreviations: 
EPC, ectoplacental cone; VE, visceral endoderm; ExE, extraembryonic ectoderm; Ep, epiblast; Ch, chorion; Al, aUantois; NE, neuroectoderm; M, mesoderm; DE, 
definitive endoderm; Ht, heart; Sp, spongiotrophoblast layer; MA, maternal artery; Dec, decidua; Lab, labyrinth layer; ESC, embryonic stem cells; MD, Miillerian 
duct; TC, testis cords; Oo, oocytes; CL, corpus luteum; ST, seminiferous tubules; Sd, spermatids. 



mount and section in situ hybridization on pre- and postnatal 
gonads. Both the somatic and germ cell components of the devel- 
oping gonads of both sexes express Prdm4 transcripts (Fig. IF and 
G and data not shown). We observe the strongest signal in the 
Miillerian duct mesenchyme and in the condensing testis cords. In 
adult ovaries, robust expression of Prdm4 marks oocytes at all 
stages of their maturation as well as the somatic cells of the corpora 
lutea (Fig. IH). In the testis, Prdm4 expression in the seminiferous 
tubules marks all stages of the developing spermatozoa as well as 
the Sertoli cells (Fig. II). 

Prdm4 binds a tripartite recognition sequence in close prox- 
imity to TSSs. We performed ChlP-seq experiments using stably 



transfected ESCs strongly expressing full-length Prdm4-EGFP 
from the chicken beta-actin promoter in combination with a 
ChlP-quality anti-GFP monoclonal antibody (30) (Fig. 2). Dupli- 
cate experiments performed on two independent Prdm4-EGFP- 
expressing ESC clones identified a total of 627 Prdm4 binding 
regions (see Data set S2 in the supplemental material). Analysis of 
their genomic distribution relative to gene annotations (Fig. 3A), 
together with assessment of the absolute distances relative to the 
nearest transcription start site (TSS) (Fig. 3B), revealed a signifi- 
cant enrichment within 5 kb of transcription start sites in either 
direction (P = 8.3 X 10~^^). Visualization of the average binding 
profile around TSSs demonstrates a particular bias to binding 
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^PR/SET domain Qzinc knuckle QPseudo zinc finger ^Zinc finger ^His-HA ^EGFP 

FIG 2 Illustration of the Prdm4 cDNA-protein structural domain relationships. The Prdm4 transcript with numbered exons (top) is aligned with full-length 
Prdm4 protein and the recombinant and mutant proteins represented in this study. The protein that results from deletion of exons 9 to 11 (AZF) is shown, as well 
as the recombinant His-HA-tagged zinc finger domain protein used for SELEX and EMSA. Primer sites for cloning the zinc finger domain are shown as pink 
arrows. C-terminal EGFP (in outline) denotes tagged protein expressed in stably transfected ESCs, as used for ChlP-seq, Western blot analysis, and imaging. 
UTR, untranslated region. 



fractionally upstream of the TSS (Fig. 3C). Genes exhibiting prox- 
imal Prdm4 binding are shown in Data set S2 in the supplemental 
material. 

Next, we compared our data to known ci's- regulatory elements. 
In particular, we examined Ensembl Regulatory Features, com- 
prised of regions known to have histone modifications associated 
with functionally active chromatin, DNase I hypersensitivity 
peaks, and transcription factor binding sites described in pub- 
lished ChlP-seq experiments (54). These results demonstrate that 
59% (372/627) of our Prdm4 ChlP-seq peaks map within known 
regulatory regions (Fig. 3D; see also Data set S3 in the supplemen- 
tal material). Interestingly, 33% (204/627) occur within ERVl: 
RLTR23 repeat regions (Fig. 3D). In some cases, mouse-specific 
families of transposable elements have been implicated in remod- 
eling of the transcriptional circuitry in mouse ESCs, but the spe- 
cific role of these ERV1:RLTR23 repeat regions has yet to be elu- 
cidated (55). 

Finally, to gain further insights into these Prdm4 binding 
events, we performed functional annotation. GREAT analysis 
(Fig. 3E) revealed relatively few significant terms associated with 
Prdm4 binding. However, proximal Prdm4 binding was found to 
be enriched among genes governing cell proliferation, housekeep- 
ing genes, and Mss4-like genes. Structural studies suggest that 
Mss4 may act as a guanine nucleotide-free chaperone. However, 
the functional activities contributed by the Mss4-like domain re- 
main unknown (56-58). 

In order to identify the sequence bound by Prdm4, we per- 
formed de novo motif finding. We identified three highly similar 
tripartite sequences of the form GGGn(j_4)CTnGAAAC within 
the Prdm4 binding regions (Fig. 4A; see also Fig. SI in the supple- 
mental material). Nearly all (602/627, 96%) ChlP-seq peaks con- 
tain at least one significant match to one of these motifs (see Fig. 
SI). 

To strengthen these results, we generated recombinant Prdm4- 
ZFD and performed SELEX. A highly similar motif corresponding 
to the most conserved (GAAAC) region of sequence bound by 
Prdm4 was identified (Fig. 4A). We also performed EMSAs using 



the recombinant His-hemagglutinin (HA)-tagged Prdm4-ZFD. 
The control probe corresponding to the 50-bp central portion 
of the ChlP-seq peak upstream of Bahccl contains a strong match 
for the Prdm4 15-mer binding motif (Fig. 4B). Variants with three 
discrete motif components mutated individually and in combina- 
tion were tested to evaluate their relative contributions to Prdm4 
binding (Fig. 4B). The GAAAC component of the binding motif 
appears to be essential for Prdm4-ZFD binding, whereas the CT 
and GGGG components, as predicted by their weaker conserva- 
tion, modulate the strength of binding but are not individually 
required. 

Targeted deletion of the ZF domain results in a functional 
null allele. EMSAs demonstrate that a recombinant His-HA- 
tagged Prdm4-ZFD on its own is sufficient to mediate DNA bind- 
ing. Targeted deletion of the Prdm4 ZFD should therefore create a 
functional null allele. This strategy was successfully exploited to 
disrupt Prdml functional activity in vivo (8, 59). We engineered a 
targeted deletion spanning a 3.25-kb fragment (chrl0:85361568 
to 85364815, NCBI37 assembly, mm9) encompassing exons 9, 10, 
and 1 1 (Fig. 2 and 5A) that encodes 67 C-terminal amino acids of 
the PR/SET domain, the pseudo Zn finger at position 548 to 569, 
and 4 proximal Zn fingers within the ZFD (amino acid positions 
593 to 699) (Fig. 2) (Ensembl protein ENSMUSP00000041942). 
Correctly targeted cells were Cre excised to eliminate the PGK- 
Hygro selection cassette and subjected to retargeting to generate 
homozygous Prdm4^^^'^^^ ESCs. These doubly targeted clones 
appear morphologically indistinguishable from wild-type ESCs. 

Stably transfected Prdm4^^^^^ ESCs exclusively expressing 
either FL-Prdm4-EGFP or the Prdm4AZF variant at equivalent 
levels (Fig. 2 and 6A and B) were examined by confocal micros- 
copy (Fig. 6C). As expected, FL-Prdm4-EGFP-tagged protein pre- 
dominantly localizes to the nucleus, whereas Prdm4AZF-EGFP 
remains cytoplasmic. A similar conclusion was reached via West- 
ern blot analysis of nuclear and cytoplasmic fractions (Fig. 6D). 
Moreover, EMSAs demonstrate that nuclear extracts from 
Prdm4AZF-EGFP-expressing cells lack DNA binding activity 
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(Fig. 6E). A strong argument can therefore be made that the 
Prdm4AZF protein represents a functionally null variant. 

Prdm4 regulates Nodal and Klf5 expression upstream of key 
pluripotency and differentiation pathways. Next, we performed 



transcriptional profiling experiments comparing wild-type, stably 
transfected FL-Prdm4-EGFP-expressing ESCs and homozygous 
null Prdm4^^'''^^ ESCs. A large number of differentially ex- 
pressed transcripts were identified in wild-type ESCs compared 
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with Prdm4^^'^^ mutant ESCs (770 upregulated; 1,325 down- 
regulated; P < 1 X 10~^) (Fig. 7A). In contrast, relatively few 
differences were observed in comparing wild-type with stably 
transfected Prdm4-EGFP-expressing cells (23 upregulated; 15 



downregulated). Prdm4 loss of function therefore appears to have 
a substantial impact, whereas Prdm4 overexpression has a less 
striking effect on gene expression patterns. Interestingly, both 
down- and upregulated genes are significantly more likely to have 
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proximal Prdm4 binding than are all genes (Fig. 7B). These results 
strongly suggest that Prdm4 functions as both a transcriptional 
activator and a repressor. 

Our functional annotation analysis of the differentially ex- 
pressed genes revealed, among downregulated genes (GO: 
0045596, P = 1.4 X 10~^), significant enrichment for negative 
regulators of differentiation, including the key pluripotency genes 
PouSfl {Oct4), Nanog, and Bmp4 (Fig. 7C; see also Data set S4 in 
the supplemental material). Upregulated genes are significantly 
enriched for transcripts associated with embryonic development 
(GO:0043009, P = 1.7 X 10"^), including Nodal, Gli3, Gata6, 
Pitx2, and others (Fig. 7C; see also Data set S5 in the supplemental 
material). Interestingly, both Klf5 and Nodal contain proximal 
Prdm4 binding sites (Fig. 7C and D). 

Nodal, a member of the transforming growth factor p (TGF-p) 
superfamily of secreted growth factors, plays an essential role in 
anterior-posterior and left-right axis formation and mesodermal 



patterning in the early vertebrate embryo (60). Nodal/ Activin sig- 
naling regulates ESC differentiation in vitro (61-63) and functions 
to maintain pluripotency through regulation of Nanog (64). Sim- 
ilarly, the zinc finger transcription factor Klf5 is essential for ESC 
self-renewal (65) and functions redundantly in maintaining plu- 
ripotency by directly regulating expression of key target genes 
such as PouSfl (Oct4), Sox2, and Nanog (66). 

To test whether misregulated expression of Kip and Nodal 
accounts for the transcriptional shift observed in Prdm4^^^^^^ 
ESCs, we interrogated published Klf5 and Smad2 (the down- 
stream effector of Nodal signaling) ChIP data sets (50, 67). Genes 
with proximal Klf5 and Smad2 binding were compared to those 
differentially expressed in Prdm4^^'^^ ESCs. This exercise re- 
veals a highly significant association between occupancy by these 
transcription factors and genes misregulated in the absence of 
Prdm4. It is therefore tempting to speculate that Prdm4 may reg- 
ulate KlfS- and NociaZ-dependent pathways (Fig. 7E). Genes dif- 
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those with higher than tlie average intensity are red. (B) Genes either up- or downregulated in Prdm4 



' cells are significantly more likely to have Prdm4 binding 



within 5 kb of their TSS than are all genes. (C) qPCR shows that pluripotency markers are significantly downregulated in Prdm4^^"^^ cells and upregulated in 
Prdm4-EGFP-expressing cells (left) whereas differentiation markers are upregulated in Prdm4'^^'"^ cells (right). *,P< 1 X 10"^;**,P< 1 X 10"^;***, P< 1 X 10"^; 
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ferentially expressed in Prdm4'^^"^^ cells with proximal binding 
of Klf5 or Smad2 are shown in Data sets S6 to S9 in the supple- 
mental material. 

Next, to test whether this altered transcriptional signature 
could potentially influence ESC differentiation dynamics, we ex- 



amined Oct4 and Nanog expression comparing wild-type, ho- 
mozygous null Prdm4^^'^^ , and stably transfected wild-type 
ESCs expressing either full-length Prdm4-EGFP or Prdm4AZF- 
EGFP. Consistent with the results above, cells maintained in the pres- 
ence of LIE display robust expression of these pluripotency markers 
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(see Fig. S2 in the supplemental material). Cells were subsequently 
ctxltured without LIF and then stained for alkaline phosphatase activ- 
ity. As judged by morphological criteria, all the cells were induced to 
differentiate. However, a proportion of Prdm4-EGFP-overexpress- 
ing colonies retain ESC-Kke characteristics (see Fig. S2). Results of 
alkaline phosphatase staining similarly demonstrate that overexpres- 
sion enhances ESC self-renewal. In contrast, Prdm4-deficient ESCs 
display increased ESC differentiation abilities. Collectively, these re- 
sults support the idea that Prdm4 plays a regulatory role upstream of 
early developmental pathways. 

Prdm4 is nonessential for mouse development and fertility. 
To further explore Prdm4 functions in vivo, we generated mutant 
mice carrying the Prdm4'^^^ loss-of-function allele. Prdm4'^^\^^ 
ESCs were used to generate germ line chimeras via blastocyst in- 
jection. However, subsequent heterozygous intercross matings re- 
sulted in homozygous mutant progeny at the expected Mendelian 
ratios (Fig. 8A). Homozygous mutants were indistinguishable 
from wild-type and heterozygous littermates and displayed no 
overt abnormalities. 

To confirm that Prdm4 is dispensable for mouse development, 
we also generated a second targeted allele using the EUCOMM/ 
KOMP resource (68). Expression of the LacZ gene trap reporter 
allele faithfully recapitulates the endogenous Prdm4 expression 
pattern in all tissues examined, including the developing embryo 
(Fig. 9E to H) as well as prenatal and postnatal gonads (data not 
shown). Northern blot analysis of tissue from wild- type and ho- 
mozygous mice carrying the Prdm4^^ allele shows significant 
splicing around the lacZ cassette and production of the wild-type 
Prdm4 transcript (Fig. 9C). To obtain a null allele, heterozygotes 



carrying the Prdm4 ^ allele were crossed to Sox2.Cre partners to 
generate the Prdm4^'"^^ allele (Fig. 9A). This manipulation deletes 
exons 6 and 7, corresponding to amino acids 372 to 468 spanning 
the PR/SET domain, and results in a LacZ-Prdm4 fusion tran- 
script completely lacking in-frame coding information down- 
stream of the deletion (Fig. 9A). PCR genotyping confirmed the 
correct Sox2. Cre-mediated excision of the loxP site-flanked re- 
gion. As described above, heterozygous intercross matings yielded 
homozygous mutant progeny at the expected Mendelian ratios 
(Fig. 8A). Northern blot analysis of total RNA from brain and 
testes of wUd-type, heterozygous, and homozygous mutant mice 
confirmed the presence of the expected fusion transcript (Fig. 
9D). Adult homozygous mutants failed to display any phenotypic 
disturbances. The present analysis of two independently gener- 
ated Prdm4 null alleles demonstrates that Prdm4 is nonessential 
for mouse development. 

Prdm4 is robustly expressed in both the somatic and germ cell 
components of the developing gonads (Fig. IF to I), suggesting a 
potential role for Prdm4 in reproductive function. However, adult 
Prdm4^'"^^^'^^ and Prdm4^^^^^^^ homozygous mutants of both 
sexes (n = 4) test bred with wild-type partners proved fertile and 
gave rise to healthy litters. Consistent with this, both the ovaries 
and testes appear histologically normal and are indistinguishable 
from those of wild-type littermates (Fig. 8B). We found compara- 
ble numbers of developing follicles in the Prdm4^^^^^^ ovaries 
and no evidence for abnormal spermatogenesis in the mutant tes- 
tes. These findings demonstrate that Prdm4 is also dispensable for 
germ cell development and fertility. 
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DISCUSSION 

The Prdm gene family first appeared in metazoans and experi- 
enced a massive expansion in vertebrates (2). In some cases, func- 
tions appear to be well conserved. For example, Drosophila mela- 
nogaster hamlet and closely related mammalian Prdms 8, 12, and 



13 function downstream of the Notch-Hes pathway to control cell 
fate during neurogenesis (69, 70). On the other hand, striking 
species-specific differences have also been described. For example, 
Blimpl/Prdml is essential for specification of the germ cell lineage 
in mice but not in fish (7, 9). In fish, Blimpl/Prdml is required 
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during specification of the slow twitch muscle cell lineage and a 
subset of sensory neurons (71-73), but in contrast, Blimpl/Prdml 
has no known function in either muscle or neural crest lineages in 
mice (74). 

At the amino acid level, Prdm4 is highly conserved across 
mammals (96% sequence identity between mouse and rat, 94% 
sequence identity between mouse and human). Previous studies 
have described Prdm4 activities in the developing nervous system 
in rodents (32, 33, 36), but only limited information is available 
about its functional contributions elsewhere. Here, we document 
widespread Prdm4 expression throughout the developing mouse 
embryo from postimplantation stages onwards, and in nearly all 
adult tissues except the liver. Strong expression was observed in 
the reproductive organs, namely, the testes and ovaries. Develop- 
ing spermatozoa and maturing oocytes express abundant levels of 
Prdm4 transcripts. Nonetheless, we found that Prdm4 loss-of- 
function mutant mice develop normally, are healthy and fertile, 
and display no detectable phenotypic abnormalities. 

Prdm4 was initially cloned as a candidate tumor suppressor 
gene in human cancers (75) and characterized as a cytoplasmic 
effector molecule acting downstream of the p75 neurotrophin re- 
ceptor in response to nerve growth factor signaling (32). The pres- 
ent experiments confirm that nuclear localization depends on the 
Prdm4 ZFD (36), and as expected, Prdm4 DNA binding activities 
are mediated by its zinc fingers. In the case of Blimpl/Prdml, the 
zinc finger domain also serves as a binding interface for recruit- 
ment of epigenetic modifiers, namely, G9a and HDACl/2 (21). An 
alternative Aexon7 isoform, having a drastically truncated zinc 
finger domain, lacks DNA binding activity and fails to bind to G9a 
or HDACl/2 (41). Similarly here, the Prdm4AZF protein lacks the 
ability to tightly associate with chromatin inside the nucleus. As 
for the alternative Blimpl/Prdml Aexon7 isoform (41), the 
Prdm4AZF allele engineered here by homologous recombination 
in ES cells represents a loss-of-function mutation. 

The present SELEX experiments demonstrate that the ZF do- 
main is sufficient on its own to bind the Prdm4 tripartite consen- 
sus motif Complementary ChlP-seq experiments revealed that 
96% of target sites occupied by Prdm4 contain this consensus 
binding motif Thus, Prdm4 DNA binding is predominantly se- 
quence specific and independent of interactions with other tran- 
scription factors. Previous work suggested that Prdm4 functions 
as a transcriptional repressor through recruitment of HDACs 
(36). However, here integration of ChlP-seq and expression mi- 
croarray data reveals that both upregulated and downregulated 
genes display Prdm4 binding sites. These results strongly suggest 
that Prdm4 mediates both activation and repression of target 
genes. 

The cyclin E (CCNEl) gene was previously identified as a can- 
didate Prdm4 transcriptional target (36). Small interfering RNA 
(siRNA) knockdowns resulted in increased cyclin E expression, 
and downregulated expression of a luciferase reporter construct 
containing the CCNEl 1.4-kb regulatory region was observed in 
transient-transfection assays (36). However, the present ChlP-seq 
experiments provided no evidence for proximal Prdm4 binding to 
Ccnel in ESCs. The 1.4-kb CCNEl regulatory region (36, 76) lacks 
the Prdm4 consensus motif, and Ccnel expression was not found 
to be downregulated in stably transfected Prdm4-EGFP-express- 
ing cells. Surprisingly, we observe that Ccnel is significantly down- 
regulated in functional null Prdm4^^^'^^^ cells (? = 8 X lO"**). 
The CCNEl regulatory region does contain GAAAC — the mini- 



mal sequence motif identified in SELEX experiments. However, 
the larger tripartite motif present in 96% of ChlP-seq peaks and 
the broader motif elements identified in EMSAs taken together 
strongly suggest that GAAAC on its own is insufficient for Prdm4 
binding. One possible scenario is that Prdm4 has the ability to 
recognize this minimal sequence motif due to increased Prdm4 
expression levels and target availability in transiently transfected 
cells. However, we find the opposing effects, namely, Prdm4-me- 
diated repression of cyclin E expression versus decreased expres- 
sion seen here in Prdm4-deficient ESCs, very perplexing. Collec- 
tively, these observations suggest that cell-type-specific chromatin 
context could have a dramatic impact on transcriptional output at 
the Ccnel locus. 

Recent work suggests that Prdm4 regulates cell cycle progres- 
sion in neural stem cells (NSCs) (33). Thus, siRNA knockdown of 
Prdm4 cultured embryonic cortical NSCs led to precocious differ- 
entiation. Here, in the absence of LIE Prdm4^^^'^^ -nuW ESCs 
display increased differentiation abilities. However, Prdm4 loss- 
of-function mutant embryos develop normally, and adult ho- 
mozygous mutants display normal body and organ size and are 
fertile. Moreover, expression microarray analysis oiPrdm4'^'^^'^^ 
ESCs reveals misregulation of many key pluripotency genes. 
Genes with proximal Smad2 and KlfS binding significantly over- 
lap genes misreg ulated in Prdm4^^"^^^ ceUs. Both Nodal and KlfS 
contain proximal Prdm4 binding sites, strongly suggesting that 
Prdm4 functions upstream of Nodal and KlfS in the maintenance 
of pluripotency. 

Dose-dependent Nodal/Smad2/3 signaling plays essential roles 
in the early mouse embryo (77, 78). The strength of Nodal/ 
Smad2/3 signaling is tightly controlled by reciprocal feedback and 
feed-forward regulatory circuits between the embryo and extra- 
embryonic tissues (79). The n's-regulatory enhancer elements di- 
recting dynamic patterns of Nodal expression have been exten- 
sively analyzed via complementary transgenic analysis and 
targeted deletion strategies (77, 78, 80). Interestingly, the Prdm4 
binding site identified here lies near the intronic autoregulatory 
ASE (80). Recent studies demonstrate that continuous Nodal sig- 
naling actively recruits the histone demethylase Jmjd3 to the 
asymmetric as-regulatory element to counteract repression by 
PRC2 (81). Prdm4 binding potentially functions collaboratively 
with Smad2/3 to maintain Nodal expression and antagonize 
PRC2-mediated repression. 

KlfS is ubiquitously expressed in the preimplantation embryo, 
and in the trophectoderm lineage, KlfS is required to support im- 
plantation and expansion of the inner cell mass (ICM) (65). KlfS- 
deficient ESCs derived by sequential gene targeting precociously 
undergo differentiation, consistent with the idea that KlfS main- 
tains pluripotency networks (65). The regulatory elements that 
drive KlfS expression in the early embryo remain unknown. The 
ChlP-seq experiments presented here demonstrate that Prdm4 
occupancy overlaps with Esrrb, Ctcf and Tcfcp2ll binding sites 
(82), suggesting that this represents a key regulatory region driv- 
ing KlfS expression in ESCs. 

Recent experiments suggest that Prdml4 functions to ensure 
pluripotency in cultured ESCs (24) and maintain so-termed naive 
pluripotency in the ICM (83). Here, we observe that Prdml4 is 
significantly downregulated in functionally null Prdm4^^^^^ 
ESCs (Fig. 7C) (P = 2 X 10~").Arecent genome-wide analysisof 
bivalent chromatin marks associated with expression of develop- 
mental genes poised for activation revealed that the epigenetic 
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Status of extraembryonic ectoderm and visceral endoderm tissues 
isolated from the early embryo only partially overlaps with that 
seen in the corresponding trophoblast stem cells (TS cells) and 
extraembryonic endoderm (XEN) cell lines (84). Similarly, 
Prdml4 expression in the early embryo is strictly confined to the 
emerging primordial germ cells (PGCs), consistent with its essen- 
tial role in governing their specification (10). 

The present experiments demonstrate that Prdm4 regulates 
transcriptional output in ESCs. However, in vivo in the context of 
the developing embryo and adult mouse, Prdm4 is entirely dis- 
pensable. Additional work will be necessary to explore Prdm4 
functional contributions to tissue homeostasis, maintenance of 
hematopoietic stem cells, lymphocyte differentiation, and its pos- 
sible roles in cancer. We are especially curious to learn more about 
Prdm4 activities in the central nervous system. It will be important 
to evaluate whether, as in PrdmS mutant mice (85), neurogenesis 
in Prdm4^^^^^^^ mice may be compromised, leading to subtle 
behavioral abnormalities. Currently, both the Prdm4-LacZ gene 
trap and Prdm4^^^^^^^ functional null mutations are being main- 
tained on a mixed C57BL/6J:129 genetic background. Extensive 
backcrossing will be required before proper behavioral and phys- 
iological phenotyping studies can be undertaken. 
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